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Chapter 0: Introduction 


Let me be very clear here: ed(1) is the standard Unix text editor. 

Dennis Ritchie, co-creator of Unix, declared it so. Who are you 
to argue with someone who can write a complete operating system 
without using a glass teletype?’ 

Many younger sysadmins naively hoist their pennants to defend 
overblown, overwrought, overdesigned text editors like ex, vi, or even 
the impossibly bloated nvi. A few are so lost as to devote themselves 
to turgid editors meant for mere users, such as vim and Emacs. ‘This 
way lies not only appalling sysadmin skills, but an absence of moral 
fiber. As a sysadmin, you must have enough brain power to remember 
what you typed, to hold your own context in your head, and to truly 
commune with the machine on a deep and personal level. 

Most Unix hosts exist to perform real work, such as supporting 
databases or serving web pages. When you specced out the system's 
memory, processor, and disk I/O, you considered the amount of 
hardware needed to run those tasks. Any system resources above that 
are intended to support atypical peak loads, not some pathetically 
overengineered text editor. When you suck up extra kilobytes—or, 
worse, megabytes—to run a fancy editor, you steal from the host. 

Sysadmins dependent on something like vim? They actively harm 


systems. Do not trust them. 


1 You know, a glass teletype. That toy that kids keep calling a 
“monitor,” even though we all know monitors are reference speakers 
used in audio production. 


Ed Mastery 


For a random user, there’s no shame in being unable to use ed. 

I’m not qualified to become a Navy SEAL. I’m not even qualified to 
become the kind of seal that lies on a beach and barks. That’s okay. The 
world is full of computers. If you cannot handle the undiluted glory 
that is ed, use one of those friendly pointy-clicky systems. You’re who 
they exist for. I recommend those tablet computers that don’t have a 
keyboard, though, as those bright, colorful icons are very helpful to 
the non-educatable. 

It's not that using ed(1) is the pinnacle of systems administration 
achievement; it’s the minimum requirement. You must be this 
competent to manage this computer. 

If you're reading this book, it’s because you want to join the elite. 
You want to count amongst the Navy SEALS of information technology. 
You want to be a real sysadmin, not one of those posers that need 
fancy toys like a “monitor” Yes, I own monitors, but only because these 
wimpy modern laptops don't come with built-in line printers. 

Real sysadmins not only can work, they truly thrive on their wits 
and line printers alone. We welcome anyone who proves worthy of 
joining us. 

This book covers standard ed. Many operating system developers 
can't resist the urge to add additional features to ed. That’s how the 
tragedies of ex and vi happened, after all. While these developers’ 
urge to become part of something as momentous as ed is perfectly 
understandable, ed is feature complete. This book won't cover OS- 
specific extensions, such as Linux’s x and y commands and FreeBSD’s 
encryption support.’ 

I should warn you, though: some sysadmins object to this book. 


They sincerely believe that the best sysadmins learn from studying 


2 Why add encryption to ed? That’s what crypt(1) is for. 
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the operating system source code or, at worst, reading the man page. 
Many of those folks forget their own youth, though. You can't learn 
computing on your own. Even the most sagacious Unix admins had 
mentors. 

In this bleak age when search engine algorithms dredge up 
shoddily-assembled HOWTOs and pass them off as authoritative 
documents, providing for the next generation of sysadmins is one of 
the most vital tasks we face as a profession. I don’t want the servers 
storing my retirement funds managed by so-called sysadmins who 
struggle against the pomposity of ex(1). Study the ed(1) source with 
this book at your side, in combination with a good C reference, 
such as the 1978 edition of “The C Programming Language” by the 
esteemed Brian Kernighan and Dennis Ritchie. If they could write 
a complete operating system using a line printer for output, you can 
handle your itty-bitty tasks. 

Only a jerk would belittle you for attempting to transcend your 
limitations and become a proper sysadmin. Sadly, due to social stigma, 
I can't blame you if you hold owning this book a precious secret. If 
necessary, you can get instructions for making a protective book cover 
out of a brown paper bag from your nearest Gopher site. 

Real sysadmins understand how the computer works. We 
understand that the long-sought “What You See Is What You Get” 
(WYSIWYG) editor is nothing but a pernicious lie from the marketing 
department. We know in our bones that a file ending in . ext isn't 
necessarily a text file—indeed, that a text file can have any name. 
Transcending these deceits is a necessary step to achieving true 
sysadmin mastery. The standard Unix editor does not pretend to be 
anything other than an engine for manipulating text. 


And what a magnificent engine it is. 


Chapter 1: Ed Essentials 


At its heart, ed(1) is a text editor. It was written when computers didn't 
have monitors, though. Computers could write data to tape or to a 
printer. The printers of that day didn’t wastefully run whole sheets of 
paper at a time, though; like ticker tape machines and typewriters, 
these line printers printed a single line at a time. 

The sysadmin would enter commands, printing results only when 
necessary. Or, if you prefer, a clattering printer announced to everyone 
within hearing every time you lost your train of thought. Real 
sysadmins can concentrate on their work and remember context. 

Ed is a line editor. It works on lines of text, as printed by a printer. 
It'll work perfectly well on one of those new-fangled “monitor” things, 
if you're unlucky enough to be stuck with one. 

We'll begin exploring ed with a few basic tasks: starting the 


program, considering modes, saving, and quitting. 


Starting Ed 


Run ed(1) by entering ed and the name of the file you want to edit. 
Here I want to edit a text file containing my favorite poem, Lewis 


Carroll’s Jabberwocky. 


$ ed jabberwocky. poem 
963 


The number 963 does not appear anywhere within the poem. Why 
is it here? 
When you edit a file, ed reads the whole file into a memory buffer. 


Like everything else in memory, the buffer disappears when the system 


ce 


Ed Mastery 


shuts down or the program exits. When you edit the file, you change 
the copy in the buffer. At some point you might overwrite the original 
file with the buffer, also known as “saving your changes.’ You could 
write the buffer to a different file, or a new file. When you start ed, 

it tells you how many bytes it read into the buffer. Remember, real 
software uses no more memory than necessary. 

Where's the text? This is your file. Don’t you already know what's 
in it? You dont really want to waste however many lines of paper auto- 
printing every file you open, do you? Next time try using head(1) to 
peek at the contents of your file before loading it into the editor, or 


flipping back through old printouts to identify the file. 


Commands and Modes 


Ed is command-driven, which is a fancy way of saying it’s a text 
manipulation shell. You'll issue commands at the ed command 


prompt. Try it now by hitting ENTER. 


Ed has one error message, a single question mark. It means “I 
don't understand you.” When you get the error message, examine the 
command you typed and figure out what you did wrong. ENTER is 
not a valid ed command. All ed commands are letters, numbers, and 
standard keyboard symbols.’ 

The h command asks ed to explain the last error. With verbose 


error messages off, try ENTER again, then use h to explain the error. 


? 
h 
invalid address 


2 Ed is UTF-8 compliant, of course. While it can handle emoji, 
doing so imperils your immortal soul almost as much as using nano. 
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Everything's perfectly clear now, right? Probably not, but we'll 
discuss addresses in Chapter 10. 

If you have an unlimited ink budget, you could enable verbose 
error messages. Enter H by itself to toggle verbose error messages on 
and off. 


H 


Ed echoes your command to the printer. Now try ENTER again. 


? 
invalid address 


Turning on verbose error messages by default wastes both paper 
and reading time, but we've all had that experience where we stare at 
our command for far too long trying to figure out what we did wrong. 
We all need help to learn. 

Another command you might find useful is turning the command 
prompt on and off. The ed command prompt is a single asterisk. It’s 
useful when you're reviewing old printouts and need to identify your 
commands, or if you've got a fancy system with a video terminal. Enter 


p to turn the command prompt on or off. 


P 


If you're using a video terminal, you'll see the command prompt 
straight away. Printer users will get the command prompt in the line 
that displays their next command. 

Enter ed commands individually. Some of those commands, such 
as for regular expressions (Chapter 101) can be very complicated 
and contain many characters, but each command needs its own line. 
Here I try to simultaneously turn on verbose error messages and the 


command prompt. 


Ed Mastery 


I’ve confused ed. Let’s get some detail. 


h 
invalid command suffix 

Ed sees that I’ve used the command and added the P suffix. 
An ed command suffix is much like a command-line argument; it 
modifies the command. The command doesn’t take p as a suffix, so 
ed spits an error in your face and declares you unworthy. Instead, enter 


each command separately. 


$ ed jabberwocky. poem 
963 

H 

P 


You now have verbose errors and the command prompt. As 
youre just learning ed, many of the examples in this book are run 
with verbose errors and with the command prompt. It wastes ink, but 
youre worth it. 


Leave ed with the g command. 


Set an alternate prompt when starting ed, using the -p flag. 


# ed -p# jabberwocky.poem 

You might need to quote your prompt, especially if you put a space 
between -p and your prompt. Prompts that have special meaning in 
the shell, such as * and >, will cause problems otherwise. For your 
own edification, tryed -p * jabberwocky.poem and see what happens. 


Otherwise, quote your prompt. 


$ ed -p ‘>’ 
963 
~ 


jabberwocky .poem 
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All commands get run in command mode. Adding text requires 


input mode. 


Switching Modes 


While command mode is for issuing text editor commands, input 
mode lets you add text to a file. Here I open the empty file todo. It’s an 


empty file, so it uses zero bytes. 


$ ed todo 
0 


Ed supports three different ways to get into insert mode: append, 
insert before current line, and change current line. We'll discuss 
changing lines in Chapter 10. 

Enter append mode with the a command. Append mode puts the 
cursor at a new line right after the current line. As this is an empty file, 
it goes straight to the end. Anything you type gets added to the file. 
Hit ENTER to go to the next line. Here I add today’s to-do items to the 
end of this file. 

*a 
buy groceries 
restrain capitalism 
clean the rat cage 
If I can accomplish all that, it'll be a good day. 
Leave insert and append mode, returning to command mode, by 


entering a period on a line all by itself. 


Insert text before the current line with the i command. This 
requires choosing a current line, which we'll discuss in Chapter 10. 
Similarly, you can choose an existing line and change it with the c 
command. 

If you make a mistake in your command, CTRL-C aborts it and 


returns you to a command prompt. 
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I now have my to-do list in the buffer. Writing it to disk would be a 
good idea. 


Saving and Exiting 


Write the buffer back to the file with the w command. (We'll discuss 
saving to alternate files in Chapter 100). Ed prints the number of bytes 
written. After saving, exit ed with the g command. My complete ed 


session to create my to-do list would look like this. 


$ ed todo 

0 

P 

*a 

buy groceries 
restrain capitalism 
clean the rat cage 


ew 
53 
*q 

While you can't normally combine commands, you can combine 
the write and quit command. 
*wq 

If your brain slips and you don't save the file before exiting, ed 
complains. 


*q 
? 


Let's see what it’s complaining about. 


* h 
warning: file modified 


I didn't save the file. Enter the g command a second time to throw 
away your changes and exit without saving. You can also use a 9 to 
immediately exit without saving. 


Now let’s consider line addressing. 


10 
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In ed terms, an address is a line number in a buffer. Your commands 
affect an address. Many commands have effect on an address-by- 
address basis, such as “delete address” or “insert after address 5” Ed 
also has a concept of the current address, which is the default location. 
You can tell ed that you're working on line 5, and then change, append, 
or otherwise mangle that position in the file. Ed makes heavy use of 
addresses. Selecting and setting addresses is a vital part of working 
with ed. 

We'll demonstrate addresses using a file containing Carroll’s poem 
Jabberwocky. While I'm certain most of my readers, being decent and 
worthwhile people, are not only familiar with this poem but have it 
memorized, Appendix A includes a copy for those of you suffering 
from an appallingly deficient liberal arts education. It’s thirty-four 
lines long, counting the blank lines between paragraphs, so it contains 
thirty-four possible addresses. A few ed commands (but not all) can 
accept an address of zero, meaning “before address one.” 

Many ed commands expect you to put the addresses to be affected 
before the command. A command like 4a or 3, 5- means “on these 
lines, run this command.’ This swaps the most vital context out of 
your vulnerable brain and into the computer as quickly as possible. 
You know you need to delete lines, but your mind is most likely to 
forget the specific lines so it’s best to set those immediately. 

An address on its own is a single address. Two comma-separated 
addresses represent start and stop addresses, and include those 


addresses as well as everything between them. 
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When you set an address, ed prints the line. For clarity in this 


section, I set the p and s commands when starting ed. 


Setting Your Address 


When you first open a file, your current address is the last line of the 
file. Send the contents of the current address to your printer (or, if 


youre one of the fancy folks, to your monitor) with the p command. 


And the mome raths outgrabe. 


This is the last line of Jabberwocky. The special address s always 
refers to the last line of a file. 

To set the address to a line number, enter that number. I want to 
see line 6 of Jabberwocky. 
*6 
“Beware the Jabberwock, my son! 

When you change the address, ed automatically prints the line at 
that address. 


Advance the address one line at a time with the + command. 


wy 
The jaws that bite, the claws that catch! 
wy 
Beware the Jubjub bird, and shun 
wy 


The frumious Bandersnatch!” 


To forward multiple lines, give the + command and the number of 
lines you want to forward. This is the last line of the stanza, and I want 
to skip the blank line that follows,* so I advance two. 


*42 
He took his vorpal sword in hand; 


4 Remembering that the file has blank lines is the sort of thing 
you must do if you wish to be worthy of ed(1). 


12 


Chapter 10: Addresses 


Walk backwards through the buffer using the - or * commands. 
(GNU ed has disrespectuflly dropped the * command, leaving you 


with only -.) Add a number to move back that many lines. 


The previous line is blank—but we knew that. It’s why we skipped 


ahead. Set the address back one more. 


The frumious Bandersnatch!” 
The Bandersnatch is, indeed, frumious. 


Finding Your Address 


The equal command (=) prints an address. It defaults to the last 
address of the buffer. 


~ 


34 


This buffer is 34 lines long. 
The period (.) represents the current address. To see your current 


address, enter a period and the equals sign. 


9 


Our address is 9, so we're on the ninth line of the file. 

The address s represents the last line of the file. While it’s most 
commonly used to jump to the end, you can combine it with the 
equals command to explicitly list the number of lines in the file. 

* $= 
34 
* $ 

I remembered correctly; this file is 34 lines long. And we've now 


moved to the end. 


Ed Mastery 
Address Ranges 


Ed can perform operations on multiple lines simultaneously through 
two special addresses, the comma (,) and the semicolon (; ). 

The comma (also available as %) represents the whole file. It 
doesn't do anything by itself—you can't set the current address to “the 
whole file,” but combined with another command you can perform 
actions on the whole file at once. Here I combine the comma with the 


p command to print the entire file. 


*,p 
‘Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 
All mimsy were the borogoves, 
And the mome raths outgrabe. 


“Beware the Jabberwock, my son! 

The jaws that bite, the claws that catch! 
Beware the Jubjub bird, and shun 

The frumious Bandersnatch!” 


Not only does this waste a lot of paper, everyone within hearing 
distance of your line printer will know that you're either too lazy to go 
through the earlier parts of the roll to find a copy, or you hate trees. 

The semicolon represents the current address to the end of the 
file, just like .,$. Suppose you want to display the last nine lines of the 
file—two stanzas of the poem. The file is 34 lines long, so you want 
lines 26 through 34. Set the current address to 26 and send the rest of 
the file to the printer. 


*26 
“And hast thou slain the Jabberwock? 


“5p 

“And hast thou slain the Jabberwock? 
Come to my arms, my beamish boy! 

O frabjous day! Callooh! Callay!” 
He chortled in his joy. 


14 
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‘Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 
All mimsy were the borogoves, 
And the mome raths outgrabe. 


Yes, you see line 26 twice. You're clearly a tree-hater to print all 
this, though, so it furthers your goals. 
If you change your address to a range of addresses, your current 


address becomes the last address of the range. 


Relative Addresses 


You can also access lines via relative addresses. A relative address is a 
certain number of lines before or after the current address, indicated 


by a plus (+) or minus (-) sign. There's lots of good reasons to do this. 


*30 


I set my address and ed shows me the contents—a blank line. 


Where exactly in the poem am I? 


*-2,+2p 
O frabjous day! Callooh! Callay!” 
He chortled in his joy. 


‘Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 


Ah, that’s where. 

Yes, I could do the same thing with a command like 28, 32p. 
Relative addresses are just one of ed’s many user-friendly 
conveniences. 

A single minus or plus sign on its own means one. The command 
-p means “print the previous line,” while -, +p gives one line of context 


on either side of the current address. 


Ed Mastery 
Scrolling 


The scrolling feature is how ed condescends to assist those folks who 
insist on relying on a “monitor.” The z command tells ed to scroll from 
the next address to as far as the terminal allows. 

The standard terminal can display 24 lines at a time. Let’s see how 
scrolling works. Start at the top of Jabberwocky. 


*Y1 
‘Twas brillig, and the slithy toves 


Ed prints the line at this address. Now scroll by running z. 


*zZ 

Did gyre and gimble in the wabe: 
All mimsy were the borogoves, 

And the mome raths outgrabe. 


Ed starts with line two, because you've already seen line 1 and 
surely you can remember what you just read, right? The output 


continues until it fills the terminal, twenty-four lines later. 


He left it dead, and with its head 
He went galumphing back. 


Want to see more of the buffer? Hit z again. 

To start scrolling at a particular line, put the address before z. 
Maybe you want the scroll to include line 1. 
*1z 


‘Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 


To scroll a reduced number of lines, enter the number of lines you 
want to show after z. Here I scroll the third verse of Jabberwocky. Each 
verse is four lines long followed by a blank line. We want addresses 11 


through 14, also known as 11 and the three addresses after 11. 
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*11Z3 
He took his vorpal sword in hand; 

Long time the manxome foe he sought-- 
So rested he by the Tumtum tree 

And stood awhile in thought. 


Verse 3, suitable for framing! 


The z command changes the current address to the last line shown. 


Displaying Addresses 


When you're working on a long file, it might make sense to print a 
convenient master copy of the file with the address of each line. The 

n command shows each line prefixed with its address. You can use n 
with selected lines by putting the desired start and stop address before 


the command, separated by a comma. 


*11,14n 

11 He took his vorpal sword in hand; 

12 Long time the manxome foe he sought-- 
13 So rested he by the Tumtum tree 

14 And stood awhile in thought. 


Ed’s current address becomes the last address shown, or 14 here. 


Or you could print the entire file, conveniently numbered. 


‘Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 
All mimsy were the borogoves, 
And the mome raths outgrabe. 


RWN-PH # 


Only do this once you’re done rearranging your files, or you'll 


waste paper. 
Viewing Trailing Space 
One annoyance with text files is that blank spaces can mess up a file. 


You'll notice extra blank spaces in the middle of your text, but those 


extra spaces at the end of a line are tricky. You don’t want to have to 
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watch your printer to see if the head carries way over to represent the 
blank spaces, after all. Use the 1 suffix after n or p to put a s symbol at 
the end of each line. Here I view the actual spaces in the file containing 
my to-do list. 

* snl 


1 have bath $ 
2 with soap$ 


There's an extraneous space at the end of line 1. 


Bookmarks 


Scrolling through a long file can be a pain, especially if you're adding 
and removing lines. Ed allows you to create bookmarks to easily 
return to a location in the file. Each bookmark is named after a single 
lower case letter, and you'll use that letter to return to the bookmark. 
Suppose you want to place a bookmark at the beginning of each 
verse of Jabberwocky. You can't number bookmarks, but you can call 
each of the seven verses a, b, c, and so on, up to g. Use the k command 
to assign a bookmark. 
* 1 
‘Twas brillig, and the slithy toves 
*ka 
*6 
“Beware the Jabberwock, my son! 
*kb 


Every five lines you'll create a new bookmark. 
To return to a bookmark, use the single quote and the bookmark 


letter. 


* 7b 
“Beware the Jabberwock, my son! 


Now that you can dance through your files, let’s mangle some 
buffers. 
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Ed contains everything you need to change existing text files. As a 
simple example we'll muck with my to-do list. You can insert lines, 
join lines together, shuffle text around, and generally proceed with 


your day without worrying about all kinds of menus and knobs. 


Changing Lines 


A key function of a text editor is changing existing text. In ed this 
means selecting (by address) the line you want to edit, and changing it. 
Here's my to-do list. 
*,p 
buy groceries 
restrain capitalism 
clean the rat cage 

Line two is not correct. I need to change it, and I need to adda 
new item between lines two and three. Set the address to line 2. 
*? 
restrain capitalism 

Ed prints the address you set. Yes, that’s the line I want. Go into 
insert mode with the change (c) command. Unlike plain insert, c 
overwrites the existing line. 
ct al 

Type the new line two, hit ENTER, and add another line. Leave 


insert mode with a period on a line by itself. 


obliterate capitalism 
rainbow ponies all around 


Ed Mastery 


If I wasn’t completely confident in my ed skills, I could double- 
check the buffer. 
*,p 
buy groceries 
obliterate capitalism 


rainbow ponies all around 
clean the rat cage 


Yes, that’s right. I use the wq command to save my to-do list. 


Inserting Amidst the File 


The append (a) command adds a line after the current address. The 
insert before (i) command inserts a new line before the current 
address. Which should you use? Whatever does what you want to 
accomplish. I need to insert two new lines at the very beginning of the 
file, so I set the address to 1 and use the i command to put my new 
lines before that address. While I could give the 1 command and then 


i, it’s simpler to give the address with the i command. 


*Li 
invest in guillotines 
flee to private island 


My to-do list now has six items. 


Moving Text 


I work down my to-do lists from top to bottom. This means ordering 
is important. Once my morning tea delivers consciousness to my 
brain, I'll realize that I can’t very well purchase groceries after fleeing 
to the private island; I must take food with me. I’m pretty sure that I 
added “flee to private island” before “buy groceries,’ however. 
*1,3n 

invest in guillotines 


1 
2 flee to private island 
3 buy groceries 
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Yep, I messed up. I should have added my new items after line 1, 
rather than inserting them before. Fortunately, ed offers the m (move) 
command. 

The move command uses addresses both before and after the 
command. The addresses to be moved go before the command, on the 
left side. The lines get inserted after the address on the right side, after 
the m. You can use zero as the right side address, to put those lines at 
the top of the file. 

*3m1 

What was address 3 (buy groceries) should now appear after 

address 1 (invest in guillotines). Did I screw anything else up? I'll show 


the list with addresses, because I like numbers. 


1 invest in guillotines 

2 buy groceries 

3 flee to private island 

4 obliterate capitalism 

5 rainbow ponies all around 
6 clean the rat cage 


No, hang on. I won't travel with dirty rats, but I must get groceries 


before cleaning the cage. Move line 6 to after line 2. 


*6m2 

* on 

di invest in guillotines 

2 buy groceries 

3 clean the rat cage 

4 flee to private island 

5 obliterate capitalism 

6 rainbow ponies all around 


This still has problems. I know me. Investing, also known as 
“futzing around on this new-fangled Web contraption,’ is much more 
amusing than tedious chores. I really need to move items 2 and 3 to 
the top of the list. As that’s before address 1, I use a destination address 
of 0. 
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*2,3m0 

*in 

1 buy groceries 

2 clean the rat cage 

3 invest in guillotines 

4 flee to private island 

5 obliterate capitalism 

6 rainbow ponies all around 


Yes, much better. 

When you move lines to after a later line, ed uses the current 
line numbers as of the time you run the command. Moving line one 
to after line twelve shifts all the lines up one, yes. What was line one 
becomes line twelve; it doesn’t recursively start renumbering lines. 
Future move commands will use the new addresses. 

When I use multiple move commands, I start from the end of the 


buffer to reduce the amount of math I do. 


Deleting Lines 


This is an ambitious list. Can I really get all of this done today? 

Realistically, I know that once I start dinking around on the 
computer I probably won't get out of my comfy chair for the rest of the 
day. It’s best that I get rid of any items that would let me relax in front 
of a printer. Use the a command to delete addresses. Here I delete line 
3 


*3¢d 


I could specify multiple lines by separating them with a comma, 
just as in moving lines. 

The buffer now has a new line three, of course. When deleting 
multiple lines at different parts of the file, it’s best to start at the end 
and work backwards. When you delete an address, the address of all 
later lines shifts up one. Working backwards reduces the amount of 


brainpower you waste on renumbering. 
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Undo 


I'm as guilty of laziness as the next person. I need to develop the self- 
discipline to get up from the computer when I have stuff to do. That 
deleted item needs to go back on the list. Use the u command to undo 
your last action. 
*u 

Ed has one level of undo. It can only affect the very last command 
you ran, including undo. To redo, undo the undo. Hitting u three 
times undoes the undone undo. I can delete and re-insert this line 
forever, but undo will never go back as far as reversing my line 
rearrangements. That’s more than one command back. 

Regular expression operations (see Chapter 101) that affect 
multiple lines are a single command. You undo and redo such changes 


as a whole. 


Inserting Text from Files 


I perform certain tasks at the beginning of every day. For my 
convenience, I’ve created a separate file containing those items. 
The r command lets you read from another file. Here I read the 
file usualtodo and put it after address 0. That file gets added at the 
beginning of the buffer. 


*Or usual todo 
21 


21 bytes get read. What does my list look like now? 
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“on 

1 have bath 

2 with soap 

3 buy groceries 

4 clean the rat cage 

5 invest in guillotines 

6 flee to private island 

7 obliterate capitalism 

8 rainbow ponies all around 
Much better. 

Joining Lines 


Hang on, my list is messed up. The first two items should be one. 
Has some newbie playing with a bloated editor like ex(1) inserted a 
carriage return into my file? I'll execute the flunky responsible later, 
but for now, let’s join those lines with the ; command. The addresses 
you want to join get deleted from the buffer and replaced by a single 
line containing all their contents. 

The ; command uses addressing syntax similar to n. You might 
find it useful to double-check the line numbers you want to join before 


nailing them together. It’s much easier to join lines than break them 


apart. 
*1,2n 
1 have bath 
2 with soap 


Yes, those are the correct lines. Merge them into one line and 
double-check the results. 


*1,2j 

*1,2n 

1 have bath with soap 
2 buy groceries 


Much better. Ed’s current address changes to that of the merged 


line—in this case, 1. 
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Note that the previous line one had a space at the end, as shown in 
Chapter 10. Without that space, the words would have run together. 
Youd have to change that line to fix the missing space. 

Splitting lines requires using a substitution, discussed in Chapter 
110. 


Copying Lines 


This is going to be a long day. I’m pretty sure I'll need a bath at the end 
of it. Worse, the pet rats will be indignant after being hauled off to a 
private island. Their cage will need cleaning. Rather than laboriously 
typing out those items at the end of the to-do list, though, Pll copy 
them using the t (transfer) command. Put the address (or range of 
addresses) you want to copy before the command, and the destination 
after. The t command leaves your address at the last line copied. 

I just re-addressed the file by joining the first two lines, so I 
double-check the early addresses. 


*1,3n 

1 have bath with soap 
2 buy groceries 

3 clean the rat cage 


Right. I need 1 and 3 copied to the bottom of the list. I want to 
go to bed properly clean, so I start by copying “clean the rat cage” 
(address 3) to the very end of the file. 


*3t$ 


Now I can copy address 1 to the end of the file, putting my bath 


after cleaning. 
*1t$ 


How did this come out? 
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have bath with soap 

buy groceries 

clean the rat cage 

invest in guillotines 
flee to private island 
obliterate capitalism 
rainbow ponies all around 
clean the rat cage 

have bath with soap 


WOONDAUBWNE 


My to-do list is done. When I’m finished taking on the world, at 
least Pll be clean. 

While I’ve copied addresses to the end of the buffer, you can put 
them anywhere. One interesting possibility is copying the line to 
right after the current address, using the period (.). This doubles the 
line, and leaves your address set at the second copy. You can perform 
transformations on that duplicated line while not touching the 


original, as we'll see in Chapter 110. 


Appending Lines to Another File 


Use the w command to append addresses in your buffer to another file. 
Put the lines you want to include before the command, and the name 
of the file to be added to at the end. 

Nobody ever finishes everything on their to-do list. Here I copy 


what I didn’t finish onto the end of tomorrow’s list. 


*5,9W tomorrow- todo 
110 

Here’s hoping tomorrow is more productive. 

Simple editing and rearranging suffices for a to-do list. But 
sometimes you'll want your text editor to interact with the operating 


system. We'll discuss that next. 
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Opening an existing file is great, but what if you want to create a new 
file? Or maybe you're editing one file and want to switch to another? 
Perhaps you want to pull the output of a command into a file? Ed lets 
you do all this and more. 

We'll start with file manipulations, and then proceed to 


commands. 


The Empty Buffer 


We've only started ed by editing an existing file. You can run ed 
without a file, though. 


$ ed 
H 
P 


Ed gives us an empty buffer. Let's fill this void. 
wy 
? 
invalid address 

What is it complaining about here? 

Remember that the insert (i) command inserts a line before 
the current address. This is an empty buffer, so what is the current 
address? While you could make sensible guesses, ed will tell you if you 
ask. 
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Addresses are all positive. There's nothing in this file to insert a 
new entry in front of. 

We can append, though. Let’s create a couple lines of text. 
*a 


once upon a midnight dreary 
while i pondered weak and weary 


There’s some text; now let’s save it to a file. 


* Ww 
? 


no current filename 
What fresh inferno is this? 


The Default Filename 


Ed doesn’t know what file you’re working on unless you tell it. It 
doesn't blindly assume that you even want your buffer backed to a 
file at all; perhaps this buffer is security-sensitive and should remain 
entirely in memory. When you edit an existing file, ed assumes that 
you'll want to save any changes to that same file, but lets you write to a 
different file by changing the default filename. 

The default filename is the name of the file ed writes the buffer 
to. Opening a file assigns the default filename to the name of the file, 
exactly as youd expect. When you run ed without a filename, though, 
the default filename is blank. Use the command to set the filename. 
Here I take my buffer of badly typed poetry, assign it a filename, and 


save it. 
*f raven.poem 
raven.poem 


* wq 
60 
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When I want to mistype more of this poem, I can open it by 
filename and save myself the trouble. 


The w command can also set the default filename, if you give it one. 
*w empty 
0 

The default filename lets you save your buffer with alternate 
filenames. Lesser text editors implement this as “Save As” 
functionality. Suppose I want to redo the way Jabberwocky is 
formatted. It’s likely that Pll damage the text somehow, or wind up 
with something that looks worse than the original, so I want to work 


on a copy of the file. I change the default filename and save. 


*f jabberwocky-hackedup.poem 
jabberwocky-hackedup. poem 
ew 
963 
I can now muck with my copy and not damage the original file. 
Ed expects you know which files already exist on your system. If 
you use w to write to an existing file, it overwrites what’s in that file. To 


add to the end of a file rather than overwrite its contents, use w. 
Switching Files 


Why should you leave ed to edit another file? Switch the file you're 
working on with the e (edit) command. 

Suppose I’ve been mucking with the way Jabberwocky is formatted 
in an effort to make it more visually appealing, and need to add an 
item to my to-do list. Use the e command and the desired filename to 


switch files. 

*e todo 

? 

warning: file modified 
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Ed is warning me that I’ve changed my original file and haven't 
saved my changes. I must either save my changes with w, or use e a 
second time in a row to say “yes, throw away my buffer and open the 


new file.” Here I throw away my buffer. 


*e todo 

146 

*a 

fix Jabberwocky formatting 


*w 
173 
A senior sysadmin intending to discard the modified buffer will 


use £ to skip the confirmation step. 


*E todo 
146 


My to-do list is now updated. 
Saving Part of the Buffer 


Maybe you want to save a chunk of your current file to a different 
file. You can do this by giving the write (w) command addresses and a 
filename. 

I have my to-do list open. My file containing the template of my 
daily to-do items, that I normally put at the beginning of each day, is 
messed up. I want to send the first item on my to-do list to that file, 
overwriting the corrupted version with the corrected version. I give 


the addresses, the w command, and the name of the file. 


*Iw usual todo 
20 


This writes address 1, the first line, to the file usuaitodo. It’s twenty 
bytes. 

Or, suppose I’m editing Jabberwocky. I want to write verse two to 
its own file. Addresses six through nine contain verse two. I give that 


address range, the write command, and the filename. 
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*6,9w verse2.poem 
139 
I get the 139 bytes of verse two in the file verse2. poem. 
Writing part of the buffer to a file does not change your current 


address. 


Shell Escapes 


You'll frequently need to grab information from the underlying 
operating system: an IP address, a file name, something. Leaving the 
editor to look up a detail, only to restart the editor and recover your 
current address, is not only annoying—it’s inefficient. That’s where 
shell escapes come in. 

A shell escape runs a single command outside of the ed shell, 
displays the output, then returns you to editing. This displays the 
command output. Ed uses the exclamation point (!) to trigger a shell 
escape. 

Suppose I’m writing the host’s sysadmin documentation, and I 
need to know the current IP address. I don't want to see all the virtual 
interfaces, only em0. Get this information with ifconfig(8).° 
*! ifconfig em0 
em0: flags=8843<UP,BROADCAST , RUNNING, SIMPLEX,MULTICAST> mtu 1500 

lladdr 08:00:27:03:eb:10 

index 1 priority 0 Illprio 3 

groups: egress 

media: Ethernet autoselect (1000baseT full-duplex) 


status: active 
inet 203.0.113.209 netmask OxfffffFF00 broadcast 203.0.113.255 


This host’s IP address is 203.0.113.209. I can now reference that in 
my document. 
Sometimes you want to repeat a shell escape. Ed remembers the 


last shell escape you ran, and can repeat it with the ! ! command. 


5 Real Unix ships with ifconfig. And ed. 
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ane 
ifconfig em0 
em0: flags=8843<UP,BROADCAST , RUNNING, SIMPLEX,MULTICAST> mtu 1500 


It first repeats the command, in case you forgot. 
Sometimes you'll want to read the output of a command into a file. 
The r command can read the output of a shell escape into your file, 


much as it lets you read another file into your buffer. 


*r lifconfig em0 
267 

It's read in 267 bytes. Check the contents of your file and you'll see 
the command output 


ro 


“on 

1 em0: flags=8843<UP,BROADCAST ,RUNNING, SIMPLEX,MULTICAST> mtu 1500 
2 Tladdr 08:00:27:03:eb:10 

3 index 1 priority 0 Ilprio 3 


Ed normally appends output from shell escapes to the end of 
the buffer. If you want to put that output elsewhere, give the address 
you want to insert after before the prompt. Suppose I have a file that 


contains two lines. 


This is ifconfig 
That was ifconfig 


I want to insert my ifconfig output between them, or after address 1. 


*Ir lifconfig em0 
267 
If you lack confidence, check the buffer. 
*\p 
This is ifconfig 
em0: flags=8843<UP, BROADCAST , RUNNING, SIMPLEX,MULTICAST> mtu 1500 


That was ifconfig 


Yes, it’s in the correct place. 
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Send toa Program 


Reading from a shell escape is nice, but you can also send to a 
program’s standard input. This lets you feed your buffer or parts 
thereof into other programs. Perhaps I need to count how many words 
are in the current file, so I can see if I met my daily word production 
target. Much as ed(1) is the standard text editor, wc(1) is the standard 


text counting tool. Write the buffer to a shell escape. 


*w lwe 
34 166 963 
963 
This file has 34 lines, 166 words, and 963 characters. The last line 
is from ed, declaring how many bytes it wrote. It should match the 
number of characters reported by wc(1), of course. 


166 words today? I better get writing! 
Starting with an Escape 


Perhaps the next file you need to create and edit needs to start with the 

output of a shell escape. You could save your work, exit ed, restart ed, 

and run your shell escape. That’s perfectly legit, but it lacks elegance. 
The e command lets you create a new buffer pre-populated with 

the output of a shell escape. This does not change the default filename, 

however. You'll want to follow up with setting a new file to save to, 

either by using £ to set the filename or w and a filename to save the 

new buffer. 

*e lifconfig 

1049 

*f ifconfig-full.txt 

ifconfig-full.txt 

sw 

1049 
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The first command executes the shell escape ifconfig and 
reads in the output. The second, £, sets the default filename to 
ifconfig-full.txt. Finally, w writes the file to its new filename. 

If you don’t want to use the e command, you can use a shell escape 


when starting ed. Complex commands will need quoting. 
$ ed !’ifconfig em0’ 


You can now fold, spindle, and mutilate files as you wish. Now let’s 


unlock the power of ed with regular expressions. 
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A regular expression, or regex, is a string that defines a pattern of 
text. It's most commonly a string of text meant to match another 
string of text. Ed can scan its buffer for text that matches the regular 
expression, either pointing it out for your consideration or performing 
automatic transformations on everything that matches the pattern. 
Regular expressions are used most commonly for searches and text 
replacement. Ed’s regular expressions helped inspire later regex 
standards, such as POSIX. As the originator, though, ed does not 
trouble itself to support all the bloat of POSIX. These days, ed regular 
expressions are called “basic regular expressions.” You will find impure 
ed versions that have been distended by other regex versions but 
again, those are for the weak of mind and unworthy of our attention. 
Despite their reputation, looking at regular expressions does not cause 
incurable madness—that’s “extended” regexes you're thinking of. 
Before you can use any of ed’s amazing search and transformation 


features, you'll need to understand how to create regular expressions. 


Regular Expression Format 


Regular expressions commonly (but not always) appear between 
slashes. Ed’s documentation uses re as the sample regular expression, 
which would be shown as /re/. Between those slashes you'll see 
characters, wildcards, character classes, escaped characters, and 


modifiers. 
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Any single character matches itself. If you wanted to match the 
letters “t,” “h,” and “e,’ in that order, youd use a regex like /the/. 

Regexes use the period (.) as a wildcard. It matches any character. 

A character class appears in square brackets ({]). A class describes 
a bunch of characters that can appear in the expression. We'll cover 
those in detail in “Character Classes” later this chapter. 

An escaped character lets you include a character that has a special 
purpose within the regex. The most obvious example is if you need 
to include a slash in your regex. The slash is used to delimit a regex, 
so you can't do ///. You need to use a backslash (\) as an escape 
character, to tell ed that the character should be treated literally rather 
than as part of the expression. An escaped backslash would look like 
\/, and a regex to match a slash would become /\//. Searching for a 
period requires \., while a square bracket needs \{ or \}. Escaping can 
also transform a regular character into a special character, such as the 
braces used for counts and the parentheses needed by backreferences. 
When do you use which? It all makes perfect sense if you read 
the source code, but right now just nod and smile and follow the 
examples. 

Modifiers change how the regular expression gets interpreted. 
Suffixes are the most common regular expression modifiers, and 
appear after the final slash. Many of these modifiers are exactly like 
commands—for example, p tells the regex to print the match, while n 
prints the match and adds the line number. 

Let’s see how regular expressions work with a few highly artificial 


examples. 
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Searching in Ed 


All too often you know you used a word in a document, but you're not 
certain where it was. There’s no need to print out the whole file; ed lets 
you search for words by regular expression. Take this list of words— 
many similar, some not, printed with addresses so we can more easily 


discuss them. 


*in 

1 the 

2 their 

3 then 

4 Them 

5 there 

6 they’re 

7 thereafter 
8 2 

9 3weasels 
10 4 theremin 
11 /the/ 

12 pet rats 
13 [pedicular] 
14 hamster5 


Entering your regex as a command, slashes and all, tells ed to 
search for the next line containing text that matches the regex. I 


warned you this was artificial, so let’s search for the regex /the/. 


*/the/ 
the 


1 


The first line of the buffer matches. The search command set our 
address to the matching line. To search forward for the same regex, 


enter a double slash. 


wed 


their 
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Okay, that’s line 2. I could keep asking for the address, or I 
could add a command after the search command. Remember the n 


command that adds line numbers? It works as a search modifier. 


3 then 
5 there 


Hang on—what happened to line 4? Looking at the file, the word 
“Them” includes the. Shouldn't that match? No, it shouldn't. “Them” 
begins with a capital letter. The regex matches only a literal, lower 
case, the. You'll need to use a character class to make the search case- 
insensitive. 

Walking through a buffer one line at a time gets tedious, though. 
Use the g command in front of the search term to search for a regex 
globally, on the entire buffer. Put your regex right after the g. To print 
all the matches, add the trailing p command as a modifier. 

Here I globally search for the string “re,” or the regex /re/, and 
print the results. 

*g/re/p 
there 
they’re 


thereafter 
4 theremin 


The p command is the default action of a search, so you could skip 
it. Searches print their results automatically unless you say otherwise. 
But then this example wouldn't illustrate where the newcomer grep(1) 
command came from, would it? If you wanted to print line numbers 


while searching for “re” though, use the n command instead of p. 


*g/re/n 

5 there 

6 they’re 

7 thereafter 
10 4 theremin 
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Yes, grep has a command line flag to print line numbers—but why 
would you bother looking up that flag, when it’s conveniently built 
right into ed?° 

When you do a global search, your address is set to the last match. 

Search a subset of the buffer by putting addresses before the g 


command. 
*1,10g/re/n 


To search backwards from the current address, use question marks 


instead of slashes around the regex. 


*10 

4 theremin 

*?re?n 

7 thereafter 


The previous match is address 7. To repeat the current backward 
search, enter two question marks. 
*??n 
6 they’re 

The // and ?? commands both refer to the most recent regex, 


allowing you to move forward and backwards on the current search. 


Running Commands on Searches 


Commands to print the matching lines aren't the only things you can 
do with searches. You can give the search any command other than a 
search command (g, 6G, v, or v). 

An easy example is deleting matches. The a command deletes the 
current address. Here I strip away all lines that match the string “re” 
*g/re/d 

Your buffer just shrank. If you're following along at home, undo 
this before the next example. 

6 What Unix really needs is a gren(1) command. Because 
grep(1) gets lonely. 
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Say you want to move lines that match the regular expression 
to the end of the buffer. Use the m command, with the $ destination 
address that matches the end of the file. 


*g/re/m$ 
What does this give us? 
* ; n 
1 the 
2 their 
3 then 
4 Them 
5 2 
6 3weasels 
7 /the/ 
8 pet rats 
9 [pedicular] 
10 hamster5 
11 there 
12 they’re 
13 thereafter 
14 4 theremin 


Ed moved through the buffer. Every time it found a match, it 
moved that line to the end of the file. You just automatically re-ordered 
your buffer! Doing so with a simple regex like this looks trivial, but 
when you build more complicated regexes you'll find it useful. 

Suppose you wanted to move everything that matches to the 
beginning of the file. The command differs by only the destination 
address, but the results look far different. 
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*g/re/m0 

*in 

1 4 theremin 
2 thereafter 
3 they’re 

4 there 

5 the 

6 their 

7 then 

8 Them 

9 2 

10 3weasels 
11 /the/ 

14 hamster5 


The words that match our regular expression are at the top of the 
file, but their ordering has reversed. What was line 11 is now line 1, 
line 10 is now 2, and so on. In doing the global search, ed processes 
each line in order. Lines matching the regex get moved up to address 
zero. Line 5 matched, and got moved to zero. Line 6 then matched, 
and got moved to zero. Moving lines to the top of the stack—er, 
buffer—one at a time reverses their order. The same thing happens if 
you move lines to any address other than s, such as an address in the 
middle of the file. 


Avoiding this reversal requires interactive editing with a regex. 


Interactive Searching 


Sometimes you must perform specific actions on every matching line. 
Rather than forcing you to run a search and then a separate command 
on each line, ed can pause after each match and give you a chance to 
run a command. Use the G command to tell ed to pause after each 


match. 


*G/re/n 
5 there 
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I can hit ENTER to continue, or type a command. I’m going to use 
the m command to move this line to address 0, or the beginning of the 
file. 
mO 

Ed proceeds to show me the next match. I move each to a 


successive address at the beginning of the file. 


6 they’re 

ml 

7 thereafter 
m2 

10 4 theremin 
m3 


At the end of the search, ed displays the current address and the 


contents of that address. 


4 4 theremin 


What have I done here? I’ve moved every match to the beginning 


of the buffer, but retained the original ordering. 


*in 

1 there 

2 they’re 

3 thereafter 
4 4 theremin 
5 the 

6 their 

7 then 

8 Them 

9 2 

10 3weasels 
11 /the/ 

12 pet rats 
13 [pedicular] 
14 hamster5 


If you screw up this kind of operation, ed considers searches a 
single operation. You can undo even a complicated interactive search 


with u. 
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Inverted Matches 


If you want the search to hit on everything that doesn’t match 
your regex, use the v command. Here I display numbered lines of 
everything that doesn’t match “the” 


*v/the/n 

4 Them 

8 2 

9 3weasels 

12 pet rats 

13 [pedicular] 
14 hamster5 


Again, “Them” doesn’t match “the’—searches are case-sensitive. 
To interactively edit each non-matching line use the v command. 


This works exactly like interactive editing of matches with c. 


Character Classes 


Being able to match a string is fine, but any lowly WYSIWYG text 
editor can do that. What gives regular expressions power is their 
ability to describe types of text to match on. You want to match 
alphanumeric strings? Regex can do that. You want to hit any line that 
contains the numbers 3 through 5? No problem. All of these and more 
are built on character classes. 

A character class is a list of matching characters in square brackets. 
The class [rt] matches either a lower-case or upper-case letter T, 
while [Hn] matches either type of H. One appearance of a character 
class matches a single character in the regex. Character classes let us 


perform case-insensitive searches. 


*g/[Tt] [Hh] [Ee]/n 
1 the 

2 their 

3 then 

4 Them 
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Unlike searching on plain old /the/, this search picks up line 4 
with its leading capital T. 

You can also list ranges of characters, separated by hyphens. The 
class [a-z] matches every lower-case letter, while [a-z] matches upper 
case letters and [0-9] matches all digits. You can have multiple ranges 
in a single class, such as [a-za-z0-9]. If you need a class to include 
the hyphen put it first. The class -/{]] matches a hyphen, a slash, and 
either square bracket, while ,/-.]] matches whatever your system says 
the characters from / to [ are, as well as ]. 

Ranges can contain any characters you like. If I need to match lines 
containing the letters a through d, as well as u through z, lower case, 


my class would be [a-du-z]. 
*g/[a-du-z]/n 


6 they’re 

7 thereafter 
9 3weasels 

12 pet rats 

13 [pedicular] 
14 gelato5 


Use such classes to narrow in on exactly what you want, and an 


anchor to say where you want it. 


Anchors 

An anchor attaches a regex to a position in the line. They let you 
perform searches like “has a digit at the beginning of the line” or “ends 
with a Z? A caret (‘) anchors the regex to the front of the line, while 
the dollar sign ($) anchors it to the end. Here I search for all lines that 
start with the letter “t” 


*g/At/n 

the 

their 

then 

there 
they’re 
thereafter 


NOUWN 


44 


Chapter 101: Regular Expressions and Searches 


Similarly, you could search for every line that ends in a number. 


*g/[0-9]$/n 
8 2 


Anchors help you narrow down your searches. 


Inverted Classes 


Inverted classes contain all characters except those listed in the class. 
The first character in an inverted class must be a caret (*). A class like 
[*w] means “All characters except a lower case w.’ 

Searching based on inverse character classes quickly gets really 
tricky. If you want to use inverse classes, make your classes very short 
and simple. To search for everything that doesn’t include a character 
class, you're much better off using the v or v command. This search 


excludes everything that includes an upper case character. 
*v/[A-Z]/n 


Inverted classes are most useful inside a more complex regex. 
«9 


Suppose I want to match everything containing the letter “e,” so long 


as the character following it isn’t “t” 


*g/e[At]/n 

2 their 
3 Them 

4 then 


If you've been paying attention you'll notice that our first line, the, 
is missing. This contains the letter e, and it’s not followed by a letter t. 
Why doesn't it show up? The regex says we're looking for the letter “e” 
followed by any character other than “t” There’s no character following 
the “e” in the, so it doesn’t match. 

Note the difference in caret placement between the inverted class 
and the anchor. The inverted class uses a caret inside square brackets, 


while the anchor goes outside any square brackets. This lets you 
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perform searches like listing all the lines that start with a character 
other than “t.” 


*g/ATAt]/n 

4 Them 

8 2 

9 3weasels 
10 4 theremin 
11 /the/ 

12 pet rats 

13 [pedicular] 
14 gelato5 


You're probably better off using a command like v/*t/n, however. 
Regular expressions are tricky. Inverted classes are doubly tricky. 


Avoid them if you can. 


Multiple Matches and Wildcards 


Sometimes you'll want to match more than one of a character, or 
perhaps even zero or more of a character. Ed provides operators for 
this. 

Use the curly braces to match a character a certain number of 
times. Put the count directly after the character you want to match. 
A regex like e{2} would tell ed to match two “e” characters in a row. 
(You could also use a regex like ce, but that wouldn't let me set up 
the more complicated regexes we're about to see.) Normally, ed treats 
curly braces in a regex as characters to match. Escaping the braces 


tells ed to treat them special. This makes your expression a little more 


complicated. 
*g/e\{2\}/n 

15 thee 

16 theee 

17 theeeeee 
18 theeeeeen 
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We get four matches.’ Hang on a minute—there’s way more than 
two es in a row in lines 16, 17, and 18. What happened? This just 
demonstrates that ed is more detail-oriented than you are. Line 16 has 
two pairs of es in a row, with the middle e shared between them. Lines 
17 and 18 have three e-pairs in a row, plus more when you count the 
shared es. 

Matching a specific set of characters demands careful thought. 
What, exactly, do you want to match? I want to match two letters, 
exactly two letters, with something other than an “e” after them. Try 


using a character class that excludes e. 


*g/e\{2\} [Ae] /n 
18 theeeeeen 

That's definitely two es in a row, followed by a character that isn’t 
an e. 

Or suppose I want two es in a row after a character other than e, 
and to attach those es to the end of the line. 


*g/[AeJe\{2\}$/n 
15 thee 

Matching identical characters in a row demands you think 
carefully. 

Basic regular expressions let you match a range of characters. If 
you want to match, say, two to five es in a row, this looks like what 
youd want. Express this by giving the lower limit, a comma, and the 
upper limit, such as e{2,5}. You'll need to escape the braces in the 


actual ed command. 


7 These words didn’t show up in our list before? Weird. It’s 
almost like the author needed multiple identical letters in a row for 
this section, but didn't bother to go back and update any of the earlier 
examples. 
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*g/e\{2,5\}/n 
15 


thee 
16 theee 
17 theeeeee 
18 theeeeeen 


This hits the same problem as looking for two in a row. You'll need 


to add other characteristics to this regex to get the desired result. 


*g/e\{2,5\}$/n 


15 thee 
16 theee 
17 theeeeee 


This might look good at first glance, but if you count closely you'll 
see that line 17 has six es in a row. We need to set a boundary before it, 


using the not-e class. 


*g/[Ae]e\{2, 5\}$/n 
15 thee 
16 theee 

If you want to match anything that has a given number of 
characters or more, give the comma without an upper limit, as in (2, }. 
*g/e\i2,\3/p 
thee 
theee 
theeeeee 
theeeeeen 

In certain cases you'll want zero or more of a character. Indicate 
zero or more with an asterisk («). To get really crazy, combine this with 
the wildcard . that represents any character. 
*g/ . */n 


1 the 
2 their 


Yes, this matches every line in our file. Pretty useless, no? 
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The wildcard and asterisk are not useful on their own, but 
combined with other characters, classes, and anchors they become 
extremely powerful. Suppose you want to find all lines that start with a 
letter but end with a number. We've used regexes like *{a-za-z] to find 
letters at the front of a line. A regex like [0-9] s will find numbers at 
the end, no problem. But combining them gets tricky... until we allow 
other characters in the middle. 

*g/A[a-zA-Z] .*[0-9]$/n 
14 gelato5 

This allows you to declare you don't care what's in the middle or 
how long the string is. If it starts with a letter and ends with a number, 
you get it. The period represents zero or more of something, and the 
asterisk matches any character, so we're allowing zero or more of 
anything else. This lets us match really short strings like “al” without 
excluding longer strings. 

The asterisk can cause some counterintuitive output for folks 
accustomed to shell programming. You might expect a regex like 


*g/ther*/n to show words beginning with “ther,” but you'd be wrong. 


*g/ther*/n 

1 the 

2 their 
3 then 


The asterisk matches zero or more “r” characters at the end of the 
expression. The string “the” has zero “r” characters, so it matches. If 


you want one or more “r” characters, say so with a regex like rr*. 
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Commands and Searches 


One powerful feature of searches and regular expressions is that you 
can run commands after a search. Suppose you want to duplicate every 
line that matches a regular expression. As you'll be changing the file, 
start with a plain search to be sure your regex matches the stuff you 
think it'll match. 


*g/there/n 

5 there 

7 thereafter 
10 4 theremin 


That looks right. Now use the t command to copy the line, but use 
a destination address of “.”—the current line. This will insert the copy 
right after the current line.* Add the n command to print the affected 


lines. 


*g/there/t. n 


6 there 
9 thereafter 
13 4 theremin 


Note the line numbers. We've duplicated the matching lines on 
these new lines. 
This feature is most useful combined with substitutions, which 


well discuss next. 


8 Ed is not a primitive AI from Star Trek. You can't make its 
head explode via recursion or paradox—it duplicates each line only 
once and moves on. 
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While regular expressions let you search and manipulate matching 
lines, ed lets you change text based on those same regular expressions. 
Regex-based substitution is an incredibly powerful search and replace 
function that lesser text editors can only dream of. While clumsy 
regexes can shred your buffer, with practice you can quickly and easily 
transform your text. 

I’ve been swearing at—er, working with—regular expressions for 
decades. Even today, I do a search with a regular expression before 
performing a substitution based on that regex. Regular expressions 
are subtle and quick to anger. Always test them first. Even once I think 
they work, I keep a copy of the untransformed data. 

Use the s command for a substitution, followed by the regular 
expression and the new text. Separate each piece with a slash. 
s/regex/new/ 

We'll start with a simple substitution—but to achieve that, we'll 
start with a simple search. Using the same list of words as Chapter 
101, suppose we want to change all instances of “there” to “then.” First, 
search for your regular expression to see if you get what you think 


youre going to get. 
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*g/there/n 

5 there 

7 thereafter 
10 4 theremin 


Three matches. The search set our address to the last match, so let’s 
go back up to the top of the file and try a substitution. 
*1 


the 
*s/there/then/ 
? 


no match 


An error? But our search saw matching strings in the buffer! 
Remember, ed works on a line-by-line basis. Line 1 doesn’t match our 
regex, so it errors. You must use a search (either manual or automatic) 
to target substitutions. 

Substitutions allow you to programmatically correct errors. 


Consider one of the entries on our word list. 


*18 
theeeeeen 


“Theeeeeen” is not a word. The e key obviously stuck when 
someone typed “then.” We can use a substitution to fix it. Here, I 


replace any string of two or more es with a single e. 
*s/e\{2,\t/e/ 
Check the line now. 


*18 
then 


If I knew ahead of time I wanted to print the result, I could have 
added the p command at the end of the substitution. We'll cover the 


few specific commands that work at the end of a substitution. 


*s/e\{2,\}/e/p 
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If escaping all of those braces gives you trouble, you could choose 
the simpler option of repeating the substitution. The s command 


repeats the previous substitution. 


*s/ee/e/p 
theeeeen 
*s 
theeeen 
*s 
theeen 
*S 

theen 

*S 

then 

*s 

? 


no match 
You might notice that that the substitution only affects one match. 


Consider a line like this. 
*19 
WXXhXXaXXtXX XXiSXX this?XX 
I don't know how all those XX chunks got in there, but they need 


to go. You might try a substituting one of these strings with nothing. 


*s/XX//p 
whXXaXXtXX XXisXX this?XX 
The substitution engine does the least amount of work you require 
of it.’ It matches the first XX, between the w and the h, and yanks 
it out. You could tell it which of the matches to pull out by giving a 


number as an argument. Here, I substitute nothing for the seventh XX. 


*s/XX//7p 
WXXhXXaXXtXX XXiSXX this? 


9 Substitutions are suspiciously human in that regard. 
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It's useful to make minor tweaks to an existing line. The problem 
with such substitutions is that you have to count. It also makes 
repeating the substitution less useful. To repeat the last substitution, 
hit s again. 

*s/XX//p 

whXXaXXtXX XXisXX this?XX 
*S 

whaXXtXX XXisXX this?XX 
*S 

what is this? 

I could add a number after the s to repeat the substitution only on 
a specific match on the line. 

Eliminating all the matches is much simpler with the g (global) 
modifier. It tells ed to perform the substitution on every match it finds 


on the current line. I'll also add the p modifier to print the results. 


*s/XX//gp 
what is this? 


Problem solved. 


Combining Searches and Substitution 


Having to find the lines that contain the text you want to substitute 
would suck the joy out of ed. Rather than harsh your mellow, ed can 
combine searches and substitution. Give your substitution after the 
search, to say “if this matches, perform this substitution.” 

Our example tries to find the word “there” and replace it with 
“then.” The search is pretty straightforward: g/there/. Much as in the 
shell, a backslash means “continued on the next line.” I then give the 


desired substitution. 
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*g/there/ \ 
s/there/then/ 
test there 
thereafter 
4 theremin 

Ed prints the lines that match the search. But look at those lines 
now. 
* 5p 
test then 
* 7p 
thenafter 
* 10p 
4 thenmin 

Substitution success! Ed considers this combined search-and- 
substitute (or, as the kids would say, search and replace) a single 
command, so you could reverse it with u. Which I do, because this is 
a daft example. But you might use this in a script to update user login 
scripts, updating links to migrated servers, or any other tedious task. 
We'll see some examples in Chapter 111. 

The backslash that splits this operation into two lines isn't needed. 
If you put it all on one line, ed doesn’t show the lines that match the 


search. 


*g/there/ s/there/then/ 


You can have multiple substitutions in one command, but each 


substitution must appear on separate lines, separated by a backslash. 


*g/there/ s/there/then/\ 


s/t/T/n 

5 Test then 
7 Thenafter 
10 4 Thenmin 


This swaps out “then” for “there” and capitalizes the first T of each 


line. 
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While you can use apparently unrelated regexes inside your 
substitution, a failure of one substitution aborts all later substitutions. 
Arrange your substitutions carefully. 

Now that you've seen the complicated form, let’s simplify slightly. 
When you want to perform a substitution on the regex you're 
searching for, you can use the 2 address to perform the substitution on 


all available lines. 
*%s/there/Then/ 


Note this isn’t exactly the same as the more complicated 
substitution above, though; line 5 capitalizes “test” rather than “then” 


The full version is much more flexible, so well focus on that. 


Subexpressions and Backreferences 


A subexpression is a part of a regular expression. Use subexpressions to 
split a line into chunks, such as “everything before what we care about, 
what we care about, and everything after that.” Subexpressions are 
marked in parenthesis, such as ({a-za-z0-9]). Much like curly braces, 
you must backslash-escape parentheses to tell ed that you're using 
them for a subexpression. Otherwise, ed looks for literal parenthesis 
characters. While I’m introducing subexpressions, though, we'll skip 
the backslashes for clarity. 

For ed to remember the whole line, the subexpression must match 
the whole line that contains the string you want. The period and 
wildcards come in useful here. While /tne/ matches a line containing 
the string “the,” remembering the whole line requires using /.*the. */ 
—zero or more of something, what we really want, and then zero 
or more of something. Enclose the regex in parenthesis to have ed 
remember it as a subexpression: / (.*the.*) /. 

A backreference lets you refer to the literal text matched by a 


subexpression. You can use backreferences within searches, but they 
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overwhelmingly appear in substitutions. Ed recalls subexpressions by 
the order you define them in. Your first subexpression is backreference 
one, the second backreference two, and so on. Indicate a backreference 
with a backslash and its number, such as \1, \2, and so on. 

Our sample regex, /(.*the.*) /, tells ed to remember one 
subexpression. There's only a single backreference, \1. 

How would you use a subexpression and a backreference? You 
might want to search for every line that includes opening and closing 
quote marks. Human beings are notoriously inconsistent, so those 
strings might use either single or double quotes. I define a character 
class with both sorts of quotes, ['"]. Put that class inside parentheses 
to declare it a subexpression: (['"]). We could have any number 
of characters after that, up until the next appearance of whatever 
character matched the subexpression: .«. Whatever character we 
found in the subexpression is our first backreference, or \1. Put it 
all together into the regex (['"]) .*\1. I want to use this in a search 
command, so I'll need to backslash-escape the parenthesis to tell ed 
this is a subexpression: \(['"]\) .*\1. Use the g command to print the 


lines matching this regex. 


*Q/\CE'"I\) *\1/ 
"bleeeeep" 
unconscious 'people' 


Backreferences work with substitutions. Here I want to prefix every 
line that includes “the” with the string “HIT-” For your edification, I 
add the p command to print the results. I use a g command to target 
lines that match the regex /the/, so the substitution only happens 
on lines that contain that regex. The actual substitution uses a 
subexpression to capture the entire line, then a backreference to place 


that captured string in the regex. 
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*g/the/\ 
s/\C.*\)/HIT-\1/p 
the 

HIT-the 

their 

HIT-their 


The first line performs a search on our target regex, selecting lines 
that include “the” The command is broken into two lines, so ed prints 
the matching lines. The second line does the substitution, adding the 
string H1T- and then adding the \1 backreference to recall the first and 
only subexpression. I added the trailing p to print the results after the 
substitution, showing before and after. Each matching string has the 
desired prefix. 

Suppose you want a case-insensitive match on “the,” and 
you not only want to remember what you matched, you want to 
subdivide that memory into before, matching, and after chunks. 

Each piece needs its own parenthesis, giving you something like 


/(.*) ({tT] [hH] [eE]) (.*)/. 


This regex has three subexpressions, so you'll backreference them 
as \1, \2, and \3. 

How would you use this? Maybe you want to put “HERE->” and 
“<-HERE” around the match.'° Your substitution will need the first 


subexpression, then add a HERE->, the second subexpression, a <-HERE, 
and the last subexpression. 

The first line, the search, aims our substitution only at lines 
matching a case-insensitive “the.” The second line uses a regex 
with subexpressions to break that line into pieces, then performs 
a substitution with backreferences to glue the line together. Using 
this requires escaping all these parenthesis with backslashes, 


demonstrating why I skipped the backslashes while creating the regex. 


10 This is traditionally called “regex debugging.” 
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*g/[tT] LhH] [eE]/\ 

oy -*\)\CLtT] [hH] CeEJ\)\C. *\) /\LHERE->\2<-HERE\3/p 
tne 

HERE->the<-HERE 

their 

HERE->the<-HEREir 

then 

HERE->the<-HEREn 

Them 

HERE->The<-HEREm 


You can now see where your regex matched on each line, as well as 
the rest of the line. 
This particular substitution could be simplified with the s address, 


giving us: 


3 s/\(.*\)\([tT] [hH] [eB] \) \(.*\) /\LHERE->\2<-HERE\3/p. 


But that wouldn't let us use multiple substitutions in a single 


search. 


Multiple Substitutions 


You used a regex search to target your substitutions, but that doesn't 
mean you must use that regex in your substitution. Ed lets you stack 
multiple substitutions after a search, permitting complicated data 
transformations in a single command. 

Go back to Chapter 101 for a moment, where we tried to match 
strings containing two to five es in a row. That turns out to be difficult, 
because regexes can match multiple entries in one line. A string of 
six es is only two groups of three es, after all. Realistically, though, 
wed perform that search because we want to transform that data in 
some way. Stacking substitutions after a search can let you perform 
myriad transformations without the weakness of resorting to young 


whippersnapper tools like sed(1). 
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I have a text file containing occasional corrupt data. Words should 
contain either two or fewer es in a row, or greater than five. If a word 
contains strings of three, four, or five es in a row, those strings should 
be replaced with two es. If it has one, two, or six or more es in a row, I 
want to leave it alone. 

This takes careful thinking about our regular expressions. 
Identifying strings with a limited range of characters requires 
identifying what's on each side of the characters of interest. There’s 
basically three options: the es can be at the beginning or end of the 
word, or they can be in the middle. We'll need a regex and substitution 
for all three cases. Again, as we discuss the regexes Ill skip the 
backslash escapes for clarity. They'll go into the ed command line. 

For the first case, where the string of es are at the end of the line, 
use the not-e class ({*e]) to identify the beginning of the e-string. 
Then add the regex for three to five es: e(3,5}. Anchor the regex 
to the end of the line with s. That gives us a substitution regex of 
[*e]e{3,5}S. We want to replace this with whatever the not-e class 
matches, plus two es. Remembering what the not-e class matched 
requires a subexpression, so add the parenthesis around [~*e] to 
create ([*e])e{3,5}$. We replace this mess with the backreference 
and two es, or \1ece. The final substitution statement looks like 
s/([*e])e{3,5}$/\lee/. 

If the string of es are at the beginning of the line, we basically 
reverse the end-of-line regex. Use the caret (*) to anchor the regex to 
the front of the line. Add the e{3,5} regex for three to five es right after 
the caret, and use the not-e class in a subexpression: ( [*e]). That gives 
us a regex of *e{3,5} ([*e]). We want to replace this with two es, plus 
whatever the not-e subexpression matched, giving us ce\1. The final 


substitution statement looks like s/*e{3,5} ([*e]) /ee\1/. 


60 


Chapter 110: Substitution 


Having a string of es in the middle of the line steals elements from 
both of the previous regexes. We need subexpressions for not-e before 
and after the string of es, with the three-to-five es in the middle. That 
gives us ([*e]) e{3,5} ({*e]). With two subexpressions we need two 
backreferences in the substitution, or \1ee\2. The final substitution 
statement looks like s/([*e])e{3,5} ([*e]) /\lee\2/. 

Aim these substitutions with a search command. ‘The substitutions 
have no effect on strings of es longer than five characters, so we can 
run them on any pattern of three or more es: e{3, }. Our combined 
search-and-substitution command will look something like this— 


again, with backslash escapes omitted for clarity. 


g/e{3,3/ \ 
s/([Ae])e{3,5}$/\lee/p\ 
s/Ae{3,5}([Ae]) /ee\1/p\ 
s/([Ae]ef3, 5} ([Ae])/\lee\2/p 

I added the p (print) command to each of the substitutions, so that 
we can see which substitution triggers what change in each line. It’s 
classic printf-style debugging. As we used a backslash to separate the 
initial search and the first regex, ed will print the matching term. Each 
match produces four lines of output: the original line, the line after an 
e-in-front substitution, the line after the e-in-back substitution, and 
the line after the e-in-the-middle substitution. 

Adding the backslash-escapes in and running the command 
produces output much like so. 
* g/e\{3,\3/ \ 
s/\(TAe]\) e\ {3 , 5\}$/\1lee/p\ 
s/Ae\{3 , 5\F\CLAe]\) /ee\1/p\ 
s/\(TAe]\) e\f3, 5\F\CLAe]\) /\lee\2/p 
theee 
thee 


thee 
thee 
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Our first match has three es at the end. The first substitution 


catches and shortens it. 


theeeeee 
theeeeee 
theeeeee 
theeeeee 


The search catches a word with six consecutive es. None of the 


substitution regexes matched it, so it escapes unscathed. 


eeeeek! !! 
eeeeek! !! 
eek!!! 
eek!!! 
Here's a line beginning with five es. It gets whacked by substitution 


number two. 


bleeeeep 
bleeeeep 
bleeeeep 
bleep 


Unless you have a really long string of obscenities, two es is 
enough to bleep something out. The last substitution regex catches and 
trims it. 

All of this works beautifully, until someone points out that your 
regexes didn't include the special case of a line containing only three to 
five es. You'll add this substitution statement and rerun the command, 
exactly as you would with any other tool. We all have these moments. 
To perform complex multi-regex substitutions like this you're best off 


doing it in scripts, as discussed in Chapter 111. 


Underlining via Substitution 


The examples we've demonstrated have all been pretty simple. Let’s use 


them to build something more substantial. We'll use substitution and 
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plain text to underline every line that contains our regex.'' Yes, plain 
text doesn’t support underlining, but we're going to do it anyway, just 
the way Real Sysadmins did back in the day. 

We'll develop this iteratively, starting with the simple regex “the” 
Once we have a working proof of concept, we'll expand it. 


Our file contains a bunch of lines like this (shown with addresses). 


i the 
2 but then 
3 their 
4 Them 
5 4 theremin 
6 thebadgerthe 
7 thee 
Start by duplicating lines that match our regex, copying them in- 
place. 
*g/the/t. 
This gives us a buffer containing a bunch of duplicate lines. 
1 the 
2 the 
3 but then 
4 but then 
5 their 
6 their 


Remember that the t (transfer, or copy) command sets your 
address to that of the copied line. Each time this search hits a match, 
ed sets our address to the copied line. If you perform a substitution 
immediately after doing the copy, the substitution affects only the 
current line—the copy. To underline lines that match our regex, 


transform every character in a matching line to an equals sign. 


11 Because even Primordial Sysadmins had management that 
wanted pretty reports. 
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*g/the/t. \ 
s/./=/9 

*in 

1 the 

2 — 

3 but then 
4 —— 
5 their 
6 —— 
7 Them 
8 there 
9 ——— 


Lines that match “the” get a proper ASCII underline as Thompson 


and Ritchie intended, while non-matching lines do not. 


Partial Underlining 


Underlining is useful, but maybe you don’t want to underline the 
whole line, though. Perhaps you want to only underline the part of the 
line that matches the regex. That’s not much harder to do. We'll start 
with the underlining command and modify it. 

We need to break up the copied line into stuff that will become 
blank space and stuff that will become underlining. While there's 
many ways to handle this problem, my approach splits the line with a 
newline amidst the substitution. I removed the escape backslashes for 
clarity, because the remaining backslashes are important. 
$/(.*) Cehe) .*/x\1\ 

\2/ 

The regex part of the substitution includes two subexpressions. 
One is everything before our desired regex (.*). The second is the 
regex itself, (the). The regex has a third piece, .*, for everything else. 
The third piece is not a subexpression, so ed won't bother to remember 
it. (Strictly speaking the third part isn’t necessary, but I find it easier 
to explicitly say “there might be stuff after this” when reading my own 


code days or years later.) 
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Now consider the substitution part. If the search term appears 
first, wed have a blank first backreference. Later parts of this command 
assume we have something here, so we put a letter x at the beginning 
of the line no matter what. We have a backreference for the first part 
and jump to a newline. The newline isn't a “command continues on 
next line” statement; instead, it gets inserted into our substitution. Ed 
can tell the difference because the newline appears in the middle of a 
substitution. We're splitting the line into two lines. 

On the next line, the second backreference brings in the text we 


searched for. Combined with the earlier search, our buffer becomes 


something like this. 
1 the 

2 x 

3 the 

4 but then 
5 xbut 

6 then 


Everything that matches our regex gets busted up into groups of 
three lines. The first line in each group is the original line. The second 
is a copy of everything before the regex match, with an “x” prepended. 
The third is the string that matches the regex. Everything after the 
regex match is discarded. 

In our first group of three, line 2 contains only an “x” because 
there’s nothing before the regex match. Line 3 contains what matched 
our regex. 

In the second group of three (lines 4-6), line 5 shows what 
appeared before the regex match, but with a prepended “x.” Line 6 
shows the regex match. 

After each substitution, our address is set to the last line of the 
three. That’s the snipped-off chunk of text that matches our regex. 
Transform each of those characters into an equals sign with the really 
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simple substitution s/./=/g. We need the trailing g command so that 
every match gets transformed into an equals sign, not just the first one. 


This transforms our text like so. 


1 the 

2 x 

3 =— 

4 but then 
5 xbut 

6 —_ 


There's our equal-sign underlines, but they need appropriate 
spacing to put them under our terms. That’s why we saved the “stuff 
before the regex” lumps, the second line of each three-line group. 
Use relative addressing—the leading minus sign—to perform a 
substitution on the previous line, replacing every character with a 
space, as in: -s/./ /g. Working on a line changes our address to that 


line. This creates lines like so. 


al the 


then 


aun BRWN 
ion 
c 
ct 


The second line of each triplet looks empty, but contains blank 
spaces. Every line has at least one space in it, from the “x” used as a 
placeholder. That’s an extra character that will mess up our formatting, 
so use s/* //\ to remove it. There won't be any visible difference, but 
our next command will now work. 

The j (join) command defaults to connecting the following line to 
the end of the current line. The blank characters provide the spaces to 
have the equals-sign underlines line up. 

All together now! Add in the necessary backslash escapes before 


each of the parentheses and curly braces, and we can underline “the” 
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on each line. 


* g/the/t.\ 
s/\C.*\)\Cthe\) . */x\1\ 


but then 


their 


Them 
4 theremin 


thebadgerthe 


thee 


See? Microsoft Word has nothing on ed. NOTHING. 


Shortcuts and Alternates 


Ed is smart enough that it doesn't need to you to spell absolutely 


everything out. Consider the format of a search. 


g/regex/commands 
The final slash is there to separate the regex from any commands. 
If you're not running any extra commands, that last slash is 


unnecessary. You can drop it. 


*g/gela 
gelato 
In modern (mid-1980s and later) versions of ed, you don't even 
need the g command. A slash on its own triggers a search. 
“/\[.*icu 
[pedicular] 
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Similarly, the final slash on a substitution separates the substitution 
from any trailing commands. If you don’t have any trailing commands, 


you can ditch that final slash. 


*s/vim/ed 
ed 


If you want to replace the regex with nothing, as in s/regex//, you 
can shorten even further. Ed knows you want to do a substitution, 
because you used the s command. You've given it the regex and no 
substitution, so it just does it for you. This makes pulling surplus 
words out of a document easy. 

*s/emacs 

Searches and substitutions have a truly amazing number of 
backslashes. If you write a regex that looks for slashes, it starts to 
resemble the jaw of a Tyrannosaurus. Ed eases this problem by 
supporting alternate delimiters. When you use the s or g command, 
whatever character appears next takes the place of the slash. Here 
I want to search for slashes, so I use the letter “m” as an alternate 


delimiter. 


*gm/ 
/the/ 


Alternate delimiters also work in substitutions. Here I use “@” as a 


delimiter. 


*s@nanoGheresy 

Next I delete the first “h” from the current line, using “m” as a 
delimiter. 
*smh 

Making commands short is the Unix way. Alternate delimiters 
improve your ability to shorten commands. 

You now have a good grasp on how to use ed interactively. Before 
you go, though, let’s look at reusing ed commands through scripts. 
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Scripting is an utterly essential component of systems administration. 
Using ed in your scripts will not only simplify certain tasks, it 

will demonstrate your moral superiority over the lesser so-called 
sysadmins surrounding you. 

Consider our command to underline a regular expression on each 
line, from the last chapter. You really don't want to type all that every 
time you need to perform this transformation, do you? A reusable, 
editable script is much better. 

The key to scripting is to remember that ed is a command-driven 
line editor. Unlike lesser editors, it accepts instructions from standard 
input. It has no need to differentiate between human and script input. 
You can't include comments within your ed script, but you have a 


whole variety of ways to run your scripted ed. 


The Ed Command File 


The easiest way to script with ed is to create a file containing all the ed 
commands to run. and then just run it like so. 
$ ed textfile < ed-commands.ed 

The commands file contains only the ed commands you want to 
run. 

As when you're learning any scripting method, start simple. Let's 
create a script to duplicate every line that matches a regex. Open your 


editor and follow along. 
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$ ed dup.ed 

dup.ed: No such file or directory 
P 

* H 

cannot open input file 
* a 

g/the/t. 

sp 

Q 

*wq 

14 

$ 


We open an ed session and turn on prompts and detailed error 
messages. While ed doesn't care that the file doesn’t exist, enabling 
verbose errors commands it to remind us about the existing error. 

We then append to the buffer. The first command in the script is our 
friend g/the/t., straight from the last chapter. 

Once that search and copy finishes, the , p command prints the file 
from beginning to end. 

The final command in our script file is 0, to immediately quit 
without saving. 

The dot takes us back to our editing session's command line, letting 


us wq to save and quit. 


$ ed textfile < dup.ed 
179 
the 
the 


pony 
their 
their 


The first line of output is the number of bytes in the text file, just 
as if you were editing the text file interactively. We then see each line 
of the buffer, allowing us to verify that the command file does what we 
thought it did. 
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If you're happy with the results, edit the command file so that it 


will save the processed file. 


$ ed dup.ed 

14 

P 

*H 

* : n 

1 g/the/t. 
2 »p 

3 Q 


Consider the necessary changes. On line 3 you no longer want to 
quit unconditionally; rather, it’s time to save and quit. You also don't 
need to view the results of the transformation, so line two can get 
deleted. Start editing at the bottom, so that your line numbers remain 
consistent with the earlier printout as long as possible. 

*3s/Q/wq 
wq 

That replaces the 9 with wq. Ed displays the modified line. I could 
have also used the c (change) command, but a little regex practice 
never hurt anyone. Now kill line two, save, and quit. 

*2d 


*wq 
12 


Let’s see what the script shows now. 


$ ed textfile < dup.ed 
179 

266 

$ 


Ed read 176 bytes and wrote 266. Your text file has been changed— 
and best of all, by editing with ed, there's no possibility that a 
slipped arrow key would mess up the command that performed the 


transformation. All you changed was the save routine. 
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If you want to hide the bytes read and written, use ed’s -s flag. 

You want something more complicated? A command file that 
underlines the last match of a regex on a line, as we did in Chapter 
110, has one minor difference between the commands in the last 
chapter and what you need for a command script. See if you can spot 
it; 


g/the/t.\ 

S/\G *\)\Cthe\) .*/x\1\ 
\2/\ 

s/./=/9\ 

=8/./ /9\ 

s/A //\ 

j 

wq 


What's the difference? A script should automatically save its work, 
so I added wa. 
Adopting ed for routine use prevents many errors only possible in 


so-called “visual” editors. 


Ed in Shell Scripts 


While command files are fine for one-off jobs, sometimes you want to 
integrate ed into a more complicated workflow. While you could write 
a shell script that calls your command file, there’s no reason not to 
integrate those commands into the script. 

Suppose you have a whole bunch of web sites running Wordpress. 
(I know that you wouldn't serve your content in such an unseemly way, 
but most of your customers probably can’t handle wholesome, elegant 
technologies like Usenet.) You need to deploy an application firewall 
like Wordfence across the whole lot of them. Wordfence requires 
changes to each site’s . htaccess file. You could make a whole bunch 
of error-prone manual edits, or use ed to systematically and reliably 


make those edits for you. 
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First, have the script find all of the files you need to change. 
#!/bin/sh 


for x in /var/ww/*/.htaccess 
do 
Now use an echo statement to provide ed commands. Just to be 


sure, I set an address before inserting my new content before line 1. 


echo “1 


e 

# Wordfence WAF 

<Files “.user.ini’> 

<IfModule mod_authz_core.c> 
Require all denied 

</IfModule> 

<IfModule !mod_authz_core.c> 
Order deny,al low 
Deny from all 

</IfModule> 

</Files> 


# END Wordfence WAF 


q” | ed $x 
done 

At the very end, I call ed on the target file and send the contents of 
my echo statement into the command. 

That's it. We're done! 

You could use a “here” document instead, if you fear pedants 
whose sole reason for living is to shriek about unnecessary uses of 
cat(1) and echo(1). At least use an EOE (End of Ed) marker, though. 
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ed $x <<EOE 
id: 


4 
# Wordfence WAF 

<Files “.user.ini’> 
<IfModule mod_authz_core.c> 
Require all denied 
</IfModule> 

<IfModule !mod_authz_core.c> 
Order deny,al low 

Deny from all 

</IfModule> 

</Files> 

# END Wordfence WAF 

w 

q 

EOE 


If you've come this far, you now know more about ed than almost 


anyone who passes themselves off as a so-called sysadmin. Practice 


your ed. Develop skills. 


And the next time you're at a job interview where you need 


to demonstrate your skills by sharing your screen, establish your 


dominance early. Use ed. 
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Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 
All mimsy were the borogoves, 


And the mome raths outgrabe. 


“Beware the Jabberwock, my son! 
The jaws that bite, the claws that catch! 
Beware the Jubjub bird, and shun 


The frumious Bandersnatch!” 


He took his vorpal sword in hand; 

Long time the manxome foe he sought— 
So rested he by the Tumtum tree 

And stood awhile in thought. 


And, as in uffish thought he stood, 
The Jabberwock, with eyes of flame, 
Came whiffling through the tulgey wood, 
And burbled as it came! 


One, two! One, two! And through and through 
The vorpal blade went snicker-snack! 

He left it dead, and with its head 
He went galumphing back. 


“And hast thou slain the Jabberwock? 
Come to my arms, my beamish boy! 
O frabjous day! Callooh! Callay!” 
He chortled in his joy. 


Twas brillig, and the slithy toves 
Did gyre and gimble in the wabe: 

All mimsy were the borogoves, 
And the mome raths outgrabe 
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